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This  report  was  prepared  for  the  Ecosystems  Research  Division,  National 
Exposure  Research  Laboratory,  U.S.  Environmental  Protection  Agency  (USEPA), 
Athens,  GA.  It  is  the  summary  of  the  first  part  of  the  Integrated  Electronic  Data 
Repository  (IEDR)  to  support  all  phases  of  the  multiyear  Technology  Required  for 
Alternative  Analyses  for  a  Changing  Environment  (TRACE)  Program.  The 
project  was  developed  by  the  Environmental  Laboratory  (EL),  U.S.  Army 
Engineer  Research  and  Development  Center  (ERDC),  Vicksburg,  MS,  and  Jaya 
Corporation,  Huntsville,  AL.  Ms.  Brenda  Kitchens,  USEPA,  has  served  as  the 
point  of  contact  between  ERDC  and  the  USEPA. 

Authors  of  this  report  were  Dr.  Linda  Peyman  Dove,  Environmental  Systems 
Branch  (EE-C),  EL,  ERDC,  and  Mr.  Steve  Bong,  Jaya  Corporation.  The  study 
was  under  the  general  supervision  of  Dr.  Rose  Kress,  EE-C,  and  Mr.  Harold  W. 
West,  Chief,  EE-C;  Dr.  Dave  Tazik,  Chief,  Ecosystem  Evaluation  and 
Engineering  Division;  and  Dr.  Edwin  A.  Theriot,  Acting  Director,  EL. 


At  the  time  of  publication  of  this  report,  Dr.  James  R.  Houston  was  Director 
of  ERDC,  and  COL  John  W.  Morris  III,  EN,  was  Commander  and  Executive 
Director. 

This  report  should  be  cited  as  follows: 

Dove,  L.  P.,  and  Bong,  S.  (2001).  “Integrated  electronic  data  to 
support  the  South  Fork  of  the  Broad  River  research  program,” 

ERDC/EL  TR-01-14,  U.S.  Army  Engineer  Research  and 
Development  Center,  Vicksburg,  MS. 


The  contents  of  this  report  are  not  to  be  used  for  advertising,  publication, 
or  promotional  purposes.  Citation  of  trade  names  does  not  constitute  an 
official  endorsement  or  approval  of  the  use  of  such  commercial  products 
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1  Background 


The  U.S.  Army  Engineer  Research  and  Development  Center  (ERDC), 
Vicksburg,  MS,  is  currently  developing  a  design  and  implementation  plan  for  an 
Integrated  Electronic  Data  Repository  (IEDR)  to  support  all  phases  of  the 
multiyear  Technology  Required  for  Alternative  Analyses  for  a  Changing 
Environment  (TRACE)  Program.  Specifically,  the  repository  is  needed  to  support 
the  Total  Maximum  Daily  Load  (TMDL)  field  research  projects  established  in  the 
South  Fork  of  the  Broad  River  watershed.  The  purpose  of  the  repository  is  to 
archive  and  disseminate  various  types  of  data  associated  with  the  project  to 
support  environmental  monitoring  and  modeling.  The  modeling  activities  include 
those  such  as  sediment  transport,  watershed  analyses,  and  environmental  analyses. 
These  models  require  data  gathered  ultimately  from  various  data  files  submitted 
and  cataloged  in  an  orderly  fashion  within  the  repository. 

This  report  summarizes  the  design  requirements  and  considerations  used  to 
develop  a  prototype  web  based  repository  site  for  the  TMDL  research  project.  It 
is  presented  in  two  parts:  design  requirements  and  considerations  and  prototype 
web-based  repository. 
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2  Design  Requirements  and 
Considerations 


The  following  summarizes  the  design  requirements  either  discussed  at  the 
24  August  2000  meeting  between  the  U.S.  Environmental  Protection  Agency 
(USEPA),  ERDC,  and  Jaya  Corporation,  or  developed  in  response  to  this  meeting. 
These  design  requirements  have  been  used  to  develop  a  prototype  web-based 
repository.  The  overall  concept  of  the  IEDR  is  presented  in  Figure  1. 


Figure  1 .  Integrated  electronic  data  repository  concept 


a.  Architecture :  The  server  architecture  to  support  the  repository  may  be 
centralized  or  decentralized  (distributed).  The  implemented  design 
should  be  one  that  best  serves  the  anticipated  data  and  data  warehousing 
requests.  The  system  will  be  designed  initially  as  an  intranet  application 
accessible  to  only  members  of  the  research  effort  or  project.  It  is 
estimated  that  the  number  of  users  will  not  be  greater  than  10  and  the 
number  of  concurrent  users  will  not  be  greater  than  2.  If  applicable,  the 
system  will  eventually  be  migrated  to  an  Internet  application. 

b.  Hardware  and  Software:  The  system  will  be  designed  and  implemented 
in  the  Microsoft  NT  4.0  operating  system.  The  IEDR  will  be 
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implemented  on  a  NT  4.0  Windows  Server  with  the  following  minimum 
hardware  requirements:  processor  speed  of  600  MHz,  memory  of 
256Mb,  tape  backup  capability.  The  repository  will  be  designed  for  the 
above  NT  Windows  platform  running  Internet  Information  Server  (IIS), 
Microsoft  FrontPage  2000,  Microsoft  Access,  Active  Server  Page  (ASP) 
upload,  and  Persits  server  software. 

Repository  users  will  be  required  to  use  a  personal  computer  running 
Microsoft  Explorer  4.0,  Netscape  4.75,  or  greater.  To  maximize  the 
repository’s  functionality,  users  will  be  required  to  have  e-mail  access  and 
an  e-mail  program  available  on  their  computer. 

c.  Data  Sources :  The  system  will  be  capable  of  storing  and  retrieving  files 
in  several  formats,  such  as  text,  spatial,  tabular/spreadsheet,  and  video/ 
multimedia  files.  Major  sources  of  data  include:  collected  field  data, 
laboratory  analysis,  research  analysis/reports,  and  spatial/ Geographic 
Information  System  (GIS)/imagery  data.  Examples  might  include: 

(1)  Field  Data 

•  Stream  data  for  the  six  study  sites 

•  Samples/data  to  be  collected  before  and  during  each  storm 
event  for  all  monitoring  sites 

•  Stream  flow  data  collected  between  storm  events 

•  Weather  station  data 

•  Additional  rain  gauges 

•  Stream  cross  sections 

•  Data  from  real  time  gauging  stations  at  Carlton  site 

•  Rating  curves  for  each  sampling  station 

(2)  Laboratory  Analysis 

•  Suspended  sediment  data 

•  Bedload  sediment  data 

•  Nutrient  data 

•  Bacteria  data 

•  Instrumentation  data 

(3)  Research  Analysis/Reports 

•  Correlation  of  pollutants  in  groundwater  from  sediment  data 
analyses 

•  Transport  mechanisms  of  pollutants  in  streams 

•  Riparian  zone  and  restoration 

(4)  Spatial/GIS/Imagery 

•  Digital  Orthophoto  Quadrangle  (DOQ) 

•  Satellite  Imagery 

•  GIS  Data  Layers 
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d.  Query  Capability:  Server  capability  will  support  search  and  retrieval  of 
data  stored  in  a  directory  tree  structure. 

Data  files  within  the  data  file  repository  should  be  organized  by  themes. 
Data  files  stored  in  the  repository  will  generally  fall  into  one  of  the 
following  themes  (categories).  These  themes  represent  an  initial  set  and 
may  be  expanded  as  the  repository  matures. 

•  Surface  Hydrology  Data  such  as  flow  and  discharge  data 

•  Water  Quality  (nutrients)  Data  such  as  nitrate,  phosphate 
content  data 

•  Watershed  Characterization  Data  such  as  site  characterization 
data 

•  Sediment  Data 

•  Documentation  or  Reports 

•  Pathogens  Data 

•  Stream  Data 

•  Meteorological/Weather  Data 

Although  data  will  be  classified  into  one  of  the  above  themes,  it  will 
normally  be  cross  referenced  to  a  specific  date,  monitoring  site,  keyword, 
contributor,  and/or  data  type.  This  will  provide  sufficient  cross- 
referencing  to  accommodate  the  user’s  search  requirements. 

e.  Quality  Control :  The  system  will  have  a  mechanism  that  will  check  all 
incoming  data  for  viruses.  The  system  is  not  required  to  automatically 
check  the  accuracy  of  data  content  being  submitted.  This  responsibility 
rests  with  the  contributor,  author  or  originator  of  the  data.  Each  incoming 
data  file  will  be  checked  to  ensure  that  the  submitted  data  include  a 
description  and  metadata  file. 

f  Security.  Several  levels  of  security  layers  for  accessing  the  site  and  data 
files  must  be  provided.  The  initial  system  will  be  restricted  to  use  by  a 
select  USEPA  user  group.  When  the  system  is  fully  implemented, 
consideration  will  be  given  for  limited  general  public  access. 
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3  Prototype  Web-Based 
Repository  Design 


Conceptual  Design 

A  prototype  web-based  repository  for  the  TMDL  project  at  South  Fork  of  the 
Broad  River  was  developed  and  can  be  viewed  at  http://206.166.205.174/.  This 
template  represents  a  preliminary  design  that  was  developed  as  part  of  the  Year  I 
Tasks  in  the  IEDR  to  Support  the  South  Fork  of  the  Broad  River  Research 
Program  Scope  of  Work. 

Users  will  interface  with  the  data  repository  using  the  Internet  and  specially 
designed  web  pages.  Users  will  be  required  to  enter  data  at  various  stages  of  their 
query,  download,  and  submittal  steps.  Maximum  use  will  be  made  of  dropdown 
menus  and  geographical  map  selection  (i.e.,  search  for  data  by  clicking  on  map 
points).  A  general  overview  of  the  conceptual  design  is  shown  in  Figure  2. 


Home  Page  Options 

Navigation  throughout  the  repository  will  begin  with  a  home  page.  Users 
visiting  the  data  repository  web  site  will  be  greeted  with  a  home  page  that  offers 
the  user  the  following  options: 

a.  Obtain  User  ID:  A  button  at  the  home  page  will  permit  a  first-time  user 
to  proceed  to  another  page  that  will  permit  an  individual  to  request  a  user 
ID  and  password.  Security  for  the  repository  requires  that  each  user 
obtain  a  valid  user  ID  and  password. 

b.  Contact  Information:  A  list  of  pertinent  names  and  numbers  for 
contacting  to  obtain  further  information  about  the  repository  are 
presented. 

c.  Enter  Repository:  A  selection  on  the  home  page  menu  will  permit  the 
user  to  proceed  to  the  page  where  he/she  can  log  onto  the  repository,  if 
the  user  has  previously  obtained  a  user  ID  and  password. 
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Figure  2.  Conceptual  Design  of  the  IEDR  at  the  web  page  level 

After  log  on,  the  users  shall  have  access  to  many  modules  within  the 
repository. 

a.  Browse  data :  Users  shall  be  permitted  to  select  a  file  for  downloading 
from  a  directory/subdirectory  listing. 

b.  Search  data:  The  user  will  be  able  to  search  the  repository  based  on 
categories/themes  for  a  listing  of  data  files  that  may  be  selected  for 
downloading.  Several  pages  will  be  developed  that  permit  the  user  to 
view  listings  of  files  contained  in  the  repository  based  on  a  criterion. 
Users  will  be  able  to  obtain  file  listings  based  on  the  following  criterion: 

(1)  Categories/themes 

(2)  Dates 

(3)  Data  types 

(4)  Contributors/authors 

(5)  Keywords 
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(6)  Monitoring  sites 

Users  may  also  select  a  page  that  permits  them  to  build  a  query  for  file 
listings  based  on  multiple  criteria. 

The  above  searches  shall  provide  the  user  with  a  listing  of  files  and 
amplifying  data  such  as  date,  document  title,  metadata  file,  etc.  Once  the 
user  had  identified  a  file  of  interest,  the  user  can  select  the  filename  and 
dowload  the  file. 

c.  Submit  data :  This  option  allows  the  user  to  submit  a  file  to  the  repository. 

d.  Download  data :  This  option  provides  the  users  with  access  to  the  file 
directories  and  subdirectories  where  a  specific  file  can  be  selected  for 
downloading. 

e.  File  format  This  page  will  list  file  formats  acceptable  to  the  repository. 

f  File  Help  page :  This  page  will  include  a  help  section  that  provides 
additional  information  about  procedures  and  restrictions  associated  with 
the  repository. 

g.  Links:  The  file  repository  shall  provide  links  and  compatibility  with 
regional  or  other  repository  sites. 

Preliminary  web  pages  are  presented  in  Appendix  A. 
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